Corpus: ita_news_2007_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 96 97 97 98
1000 902 974 990 993 995
10000 6860 9118 9814 9950 9983
100000 15952 25011 28876 29686 29884
1000000 15952 25011 28876 29686 29884


Zipf's diagram for sentence endings


Gnuplot diagram

2071 msec needed at 2018-03-10 23:09